home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
Aminet 30
/
Aminet 30 (1999)(Schatztruhe)[!][Apr 1999].iso
/
Aminet
/
text
/
misc
/
TextInfo.readme
< prev
next >
Wrap
Text File
|
1999-03-02
|
3KB
|
67 lines
Short: Sorts unique words in texts after frequency
Author: blodskam@hotmail.com (Erik Spåre)
Uploader: blodskam@hotmail.com (Erik Spåre)
Version: 1.3
Type: text/misc
Requires: OS 2.04 or above
**********************************************************************
* TextInfo 1.3 by Erik Spåre (Parsec/Phuture 303) 990107 *
* Filematching routines by Anders Vedmar (Axehandle) *
**********************************************************************
TextInfo's task is to count all the various _unique_ words in texts
and list them. "Mine your mine" = 3 words, 2 unique (mine, your).
Read. If you find the following facts interesting, then this may
be a program for you.
! Two Cities by Dickens has almost twice as many unique words
as the Koran, despite the fact that the Koran is bigger. (Two
Cities 8041/138384, Koran 4300/152164). Plato's the Republic
(translated, as the Koran) has 45 % more (6199/127005).
! Moby Dick has 13403/213486 words; that is 27 % more than
the Bible's 10560/812394, even though Moby Dick's size is only
about one quarter of the Bible's!
! A friend (greetings Théonore) told me that he had heard that
there was a word in the Bible that occured 666 times, and it
was the name of the Beast. There is no such word in King
James Bible...
! You need only to know the meaning of 48 words to understand
half of the words written in the Bible... (37 in the Koran,
44 in the Republic, 64 in Two Cities and 87 in Moby Dick).
REQUIREMENTS
OS 2.04+ and a mind that is more interested in how many grains of
sand the average beach contains, than the latest sport results or
soap operas.
HISTORY
v1.3 (990107)
** When I wanted to include the output in an email, the mailer
of course couldn't handle the tabs. I tried to circumvent this
by specifying -t32,1, hoping that TextInfo would make the output
32 characters wide... but when it didn't work I vaguely remembered
being too lazy to accept more tabs than 9 (one digit only) and
so I fixed this and made sure that spaces are printed instead
of tabs, if the tabsize is set to 1.
This is all for this release, still no request for more features.
Perhaps the program is perfect now? :)
============================= Archive contents =============================
Original Packed Ratio Date Time Name
-------- ------- ----- --------- -------- -------------
12860 5791 54.9% 07-Jan-99 01:21:04 +TextInfo
17427 7486 57.0% 07-Jan-99 01:22:52 +TextInfo.doc
2418 1193 50.6% 07-Jan-99 01:23:10 +TextInfo.readme
-------- ------- ----- --------- --------
32705 14470 55.7% 07-Jan-99 19:24:06 3 files